Conformational Entropy of Compact Polymers 
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Exact results for the scaling properties of compact polymers on the square lattice are obtained 
from an effective field theory. The entropic exponent 7 = 117/112 is calculated, and a line of 
fixed points associated with interacting chains is identified; along this line 7 varies continuously. 
Theoretical results are checked against detailed numerical transfer matrix calculations, which also 
yield a precise estimate for the connective constant k = 1.47280(1). 
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Polymers are long, flexible molecules, and as such their 
conformations are well described by different types of 
random walks [Q. For example, polymers in good sol- 
vents are modeled by self-avoiding random walks which 
can be mapped to a magnetic system at the critical point. 
This mapping leads to controlled approximate calcula- 
tions for conformational exponents v and 7 in three di- 
mensions Q, and exact results in two These expo- 
nents describe the power-law dependence of the linear 
size, and of the conformational entropy, on the number 
of monomers, respectively. 

Unlike polymers in good solvents which arc swollen, 
globular proteins in their native state form compact 
structures. On the lattice, compact polymers are mod- 
eled by Hamiltonian walks, i.e., self-avoiding walks that 
visit all the sites. 

Recently, compact polymers on two and three- 
dimensional lattices have become the model of choice for 
protein folding studies |^ . Here the focus is on the effect 
of non-specific and non-local hydrophobic interactions 
among the amino acids, on the folding process, and on 
the formation of secondary structure (helices and sheets) . 
These investigations have been almost exclusively numer- 
ical, and an analytical theory of protein conformations 
that takes into account self-avoidance and compactness, 
as well as specific sequence information, would be of con- 
siderable interest . As a first step towards this goal we 
construct a field theory of compact polymers on the two- 
dimensional square lattice which provides detailed and 
exact information about their conformational statistics. 

Conformational exponents for compact polymers on 
the honeycomb lattice were calculated recently using 
Bethe Ansatz techniques [Q . Since there is a certain de- 
gree of frustration associated with the Hamiltonian con- 



straint one might expect the exponents to change from 
one type of lattice to another. Namely, the number 
of contacts, i.e., monomer pairs which are not adjacent 
along the chain but occupy nearest neighbor positions 
on the lattice, is one per monomer on the honeycomb, 
and two per monomer on the square lattice. Since hy- 
drophobic interactions in lattice models of proteins occur 
at contacts, the square and honeycomb problem describe 
different physical situations. 

Here we present for the first time exact results for com- 
pact polymers on the square lattice. From numerical 
transfer matrix calculations it was already observed that 
7 — 1.0444(1) [H differs from the honeycomb value 7 = 1; 
the latter was also the outcome of a mean-field calcula- 
tion je). We find 7 = 117/112 = 1.0446 . . .. Furthermore, 
we describe a line of fixed points associated with inter- 
acting compact polymers, along which 7 varies continu- 
ously. These results are in excellent agreement with our 
numerical transfer matrix calculations, which also pro- 
vide a very precise estimate of the connective constant, 
K — 1.47280(1). This constant describes the leading, ex- 
ponential scaling of the number of compact polymer con- 
formations (C) with the number of monomers: C ~ k^. 

Our identification of compact polymers with a criti- 
cal model also has bearings on the cooperativity of pro- 
tein folding thermodynamics 1^. Namely, since there is 
no energy gap separating the first excited (non-compact) 
state from the native (compact) ones, at least in the large 
chain limit, we conclude that homopolymer collapse is a 
one-state process Q . 

a. Loop model To construct a field theory of compact 
polymers we make use of the two-fiavor fully packed loop 
(FPL^) model on the square lattice QJ^. It plays the role 
of the 0{n) loop model B from which exact results for 
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swollen and dense polymers were derived. The polymer 
problem is recovered from the loop model in the limit of 
vanishing fugacity for one of the loop flavors. 

The allowed loop configurations (Q) of the FPL^ model 
are defined by drawing (black) loops along the bonds of 
the square lattice with the constraint that each site is 
covered by exactly one loop. Loops are not allowed to 
cross and each is assigned a fugacity rib. The bonds not 
covered by loops also form (gray) loops whose number 
(TVg) is not constrained by the number of black loops 
(TVb). Gray loops do not intersect, and each is assigned 
a fugacity rig. The partition function of the FPL^ model 
is 

^ = E«^- (1) 

g 

The FPL^ model is the first loop model studied to 
date that possesses a two-dimensional manifold of criti- 
cal fixed points, for < nb,ng < 2 rib ^ is the 
compact polymer problem. From here on we focus on 
the critical region of the phase diagram, for which exact 
results were previously obtained only along the line of 
special symmetry, — Ug . 

b. Height model For the FPL^ model we construct an 
effective field theory by mapping it to an interface model. 
The basic idea is to interpret the loops as contour lines of 
a height . First we orient the loops independently and 
randomly, so as to be able to decide in which direction 
the height increases. Given an oriented loop configura- 
tion the microscopic heights z are defined at the centers of 
the lattice plaquettes. Each bond is in one of four states, 
labeled by vectors A, B, C or D, depending on its flavor 
and direction. Starting from an even site an oriented 
black (gray) loop is defined as a sequence ABAB . . . 
(CDCD . . .) of bond states. The increase in z, when 
going clockwise around an even site, is A, B, C or D de- 
pending on the state of the bond being crossed. The fully 
packing constraint implies that all four bond states are 
represented at every site, hence, in order for the height 
to be well defined, we must have A-l-B + C + D — 0. The 
four vectors thus span a three-dimensional vector space 
which is the space of heights. We adopt the normaliza- 
tion of Ref. §: A = (-1, +!,+!), B = 
C = (-f,-f,-f), and D = (+l,-f,+f) 

To complete the mapping from loops to heights we 
must specify the way in which the fugacities rib and rig are 
distributed between the two possible orientations. The 
clockwise oriented black loops are assigned the weight 
exp(i7reb), and similarly for the gray loops with eg re- 
placing eb. The anti-clockwise loops are weighted with 
the opposite phase. This way, summing over the two 
orientations, for any given loop produces the original fu- 
gacities: 

rib = 2cos(7reb) , rig = 2cos(7reg) . (2) 



The reason for choosing to redistribute the loop fu- 
gacity in this fashion is that it allows for a local defini- 
tion of oriented loop weights, which ultimately leads to 
a local effective field theory. In particular, if we assign 
to every right turn of an oriented black loop the weight 
Ab — exp(i7reb/4), and X^^ for a left turn, then the whole 
loop will be weighted correctly, since the difference in the 
number of right and left turns for a closed loop on the 
square lattice is ±4. The rule for the gray loops is the 
same with Cg replacing Cb . This assigns to each vertex of 
the square lattice a weight A(x) which is the product of 
the local weights associated with the oriented black and 
gray loop passing through x. 

Once the height at the origin is fixed, oriented loop con- 
figurations are in a one to one correspondence with height 
configurations which acquire their weights. Coarse- 
graining of the microscopic heights produces the height 
field h(x) whose fluctuations are described by a confor- 
mally invariant Liouville field theory. 

c. Liouville theory The effective field theory for the 
coarse grained heights is given by the Euclidean action 

S = 5'e + Sb + >5'l (3) 
where each of the three terms 

a/i" • dhl^ (4) 

^^^i^J (^'o ■ ^) ^ 

Sl^ fd^x Weexp(ie-h(x)) , (6) 

has a concrete geometrical interpretation. 

The first, elastic term, accounts for the entropy of ori- 
ented loop configurations. The symmetries of the ori- 
ented loop model impose constraints on the stiffness ten- 
sor IK: i^ii = K33 and K12 = K23 = ^. We thus find 
three elastic constants which are not related by symmetry. 
This is in contrast to all previously solved loop models 
which are characterized by a single coupling constant . 

The second, boundary term, describes the coupling 
of the height field to the scalar curvature TZ. We are 
only concerned with the FPL^ model defined on fiat 
lattices for which TZ is zero everywhere except possi- 
bly at the boundary. For example, on the cylinder 
TZ = 47r[(5(oo) — (5(— 00)], and has the effect of in- 
serting vertex operators exp(±i7reo ■ h) at the two far 
ends. These vertex operators supply winding loops with 
the weight exp[i7reo • (h(oo) — h(— 00))], since these are 
the only loops that contribute to the height difference 
between the two far ends. This extra phase factor is nec- 
essary, for a winding loop has an equal number of left 
and right turns, implying that the local vertex weights 
would assign it a total weight of 1, regardless of direction 
or flavor. For Sb to correct this, the background charge 
must be 
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The third term, the so-caUed Liouville potential, is 
the coarse-grained version of the microscopic weight, 
JI^A(x), assigned to an oriented loop configuration. If 
we define w(x) = — ln(A(x)), then the operator w(x) is 
invariant under translations in height space that form the 
bcc lattice TZw- As such it can be expanded in a Fourier 
series, Eq. (H), where the electric charges e take their 
values in the reciprocal lattice 7^^; 72.^ is an fee lattice 
with a conventional cubic cell of side 27r . 

The three elastic constants appearing in the action 
completely determine the scaling dimensions of all op- 
erators constructed from the height. Apart from the 
afore-mentioned vertex operators there are also defect 
operators which correspond to vortex configurations of 
the height. Vertex operators are defined by the electric 
charge e while defect operators are characterized by the 
magnetic charge m, which is the height mismatch around 
the vortex core. The scaling dimension of a general op- 
erator with both electric and magnetic charge is 



x(e, m) = — e • [IK 
47r 



(e - 2eo)] + -^m • [IK • m] ; (8) 

47r 



IK is the stiffness tensor in Eq. (^. 

To calculate the three elastic constants that make up 
IK we turn to the loop ansatz which states that the 
Liouville potential is marginal, i.e., the most relevant (in 
the RG sense) vertex operators appearing in the sum 
in Eq. have dimension x{e, 0) = 2. There are four 
such vertex operators among the twelve shortest vectors 
in TZ^: (-7r,0, ±7r) and (-7r,±7r,0). Using Eq. (|) this 
leads to four equations for the three elastic constants with 
the unique solution: 



Kii = - (2 - eb - eg). 



g (eb - eg), 

TT (l-eb)(l-eg) 
2 2 - Cb - e,. 



(9) 



d. Conformational exponents Points in the critical 
region of the FPL^ model are characterized by the cen- 
tral charge and the geometrical scaling dimensions Xs^^^s^, 
which are both functions of the loop fugacities and 
Ug. The central charge provides information about the 
finite-size corrections to the free energy, while the scal- 
ing dimensions are defined by the asymptotic relation 
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, (0, r) is the partition 



function of the FPL^ model with the constraint that there 
are Sb black and Sg gray strings connecting points and 
r; see Fig. ^ We consider the two end points to be in the 
bulk, in which case Sb -t- Sg is necessarily even; the odd 
case is associated with boundary operators M . 




b) 




FIG. 1. Defect configurations used for calculating geo- 
metrical scaling exponents xi^i (a) and X2,o (b). Bond states 
around point 0, listed clockwise from the leftmost bond, are 
DC AC (a) and AACD (b). 

The central charge of the Liouville field theory is [Eo| 



c = 3 + 12a;(eo,0) = 3-6 



1 - Cb 1 - e„ 



(10) 



In Table || we compare this formula with numerical trans- 
fer matrix calculations along the compact polymer line 
(rib = 0, eb = 1/2) and excellent agreement is found. 
This holds true for the whole critical region [^ . 

Here we are interested in the conformational exponents 
7 and v for compact polymers [|ll|, and so it suffices to 
calculate the geometrical scaling dimensions xi^i and X2fl 
[|l2|. The formula for the whole spectrum a^sb.sg is a sim- 
ple generalization of the calculation presented below 

To calculate xi^i we consider the diagram in Fig. Ill 
which represents an FPL^ configuration with a single 
black and a single gray string connecting the two points 
separated by r. In the height representation these con- 
figurations are associated with topological defects with 
charges ±mi^i placed at the two points; mi^i = C — B = 
(—2, —2,0) is the net height change upon encircling the 
point 0. Since an oriented segment is weighted by a com- 
plex phase, whose value depends on the number of times 
the string winds around and r, vertex operators with 
charge Gq have to be inserted at both points in order to 
weight all segments equally ||]. Therefore, xi^i is the di- 
mension of an operator with total charge (eg, mi_i), and 

2x1,1 = i[(l-eb) + (l- eg)] 



(l-eb)(l-eg) 
(l-eb) + (l-eg) 



1 - eb 



(11) 



follows from Eqs. |g and ^. 

Similarly the diagram Fig. ^ leads to the result: 

2x2,0 = 2x(eb, m.2,2) = (1 - eb) 



1 - eb 



(12) 
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In this case point serves as a source of two black 
strings, and therefore corresponds to a topological de- 
fect of strength m.2.2 = A — B = (-2,0,2). The electric 
charge eb = — 7r/2(eb, 0, — eb) now compensates for the 
extraneous phase factors due to the windings of the black 
loop segments only, since no gray segments connect the 
two points. 

Our main results follow from Eqs. (pl|)"(p^) and the 
scaling relations 7 — 1 — and 1/v = 2 — X2.o For 
compact polymers 7 ~ 117/112 is obtained by setting 
eb = 1/2 (rib = 0) and Cg = 1/3 (rig = 1) in Eq. ([u]). 
The fact that a; 1.1 < indicates an effective repulsion 
between the chain ends. This is to be contrasted to the 
mean- field result 7=1, also found for compact poly- 
mers on the honeycomb lattice Q , which implies that the 
chain ends are uncorrelated. Hitherto numerical studies 
based on enumerations of short compact polymer confor- 
mations have failed to provide the accuracy needed 
to distinguish 7 from its mean-field value. 

Examination of Eq. (|ll|) reveals a novel feature of com- 
pact polymers on the square lattice. Namely, allowing rtg 
to take different values along the nb = line represents a 
situation where different conformations arc weighted dif- 
ferently depending on the number of gray loops present. 
In the critical region of the FPL'^ model each of these 
weighted compact polymer problems defines a different 
critical geometry characterized by a continuously vary- 
ing 7. Similar behavior was predicted for directed self- 
avoiding walks with orientation dependent contact inter- 
actions 0. 

Finally, from Eq. ( p^ ) we obtain the conformational 
exponent v = 1/(2 — X2,o) — 1/2, independent of rig. 
This result serves as a nice consistency check on our the- 
ory since compact structures have Hausdorff dimension 
D — 2 regardless of how they are weighted, and f = 1 /D. 

e. Numerical results To check our results we have 
constructed transfer matrices in a connectivity basis anal- 
ogous to that of the 0(n) model jl^, but taking into 
account the additional flavor information of the FPL^ 
model The various sectors containing 0, 1 and 2 
strings were considered for strip widths up to L = 14. 
Conformal invariance was used to relate the finite-size 
corrections of the eigenvalue spectra to the central charge 
and various geometrical scaling dimensions. Results 
along the compact polymer line are shown in Table |; 
agreement with theory is excellent, apart from discrep- 
ancies for xi.i at Tig = 2 which we attribute to logarith- 
mic corrections. An extrapolation scheme based on the 
exact values of c, Eq. (^0|), yielded very precise estimates 
of the connective constant k. Like 7, k for n„ = I dif- 
fers slightly from its mean-field value 4/e = 1.4715 ... [p|, 
revealing the entropic origin of the effective repulsion be- 
tween chain ends. Furthermore, k changes continuously 
with the interaction-like parameter rig. 





0.0 


0.5 


1.0 


1.5 


2.0 


c 


-3.004(5) 


-1.815(3) 


-0.998(2) 
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Xl,l 


-0.2500(3) 


-0.1313 (9) 


-0.0439(9) 


0.0255(9) 


0.0839(5) 
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-0.25 


-0.1323. . . 


-0.0446. . . 


0.0260. . . 


0.1042. . . 
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0.0000(0) 


0.0000(0) 


0.0000(0) 


0.0000(0) 


0.0000(0) 


(12) 

















K 


1.41422(2) 


1.44477(1) 


1.47280(1) 


1.49896(1) 1.52371(1) 



TABLE I. Numerical results for the central charge c, geo- 
metrical exponents xi.i and 2:2,0, and the connective constant 
K along the line rih — 0. Comparison is made to predictions 
from Eqs. (10), (11) and (12). 

In conclusion, we have constructed a field theory of 
compact polymers on the square lattice from which exact 
results regarding their scaling properties were obtained 
for the first time. We hope that this might serve as a first 
step towards an analytic theory of simple lattice models 
of globular proteins ^ . 
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